Updated: April 29th, 2026
Overview
Yale’s Data-Intensive Social Science Center (DISSC) was founded in 2022 to address a growing need for coordinated support for data-intensive social science research. DISSC serving as a central research support hub, expanding access to computational resources, data science tools, and statistical methods for social science researchers across the university 1.
They assist colleagues with data acquisition, access, and storage, and maintain a listing of new data licenses acquired for the Yale community or publicly available in partnership with the Yale Library. Their site also includes links to additional data sources made available through other groups and sources. Their Discover and Access High-Value Social Science Data webpage includes links to additional data sources made available through other groups, alongside a full listing of DISSC-managed data acquisitions. At the time of this writing, current offerings include 2:
Census Data: Individual-level and household microdata from US decennial census records (1870–1950), available through Integrated Public Microdata Series (IPUMS).
Consumer and Spending Data: Multiple datasets covering US consumer demographics, spending, and purchasing behavior, including longitudinal consumer history (Infutor), retail and product-level sales data (Kilts-Nielsen), de-identified credit and debit card transactions since 2019 (MBHS3), and multi-channel consumer panel data covering both e-commerce and in-store purchases since 2017 (Numerator).
Health and Pharmacy Data: Drug formulary and prior authorization data covering pharmacy and medical benefit coverage across thousands of US health plans, pharmacy benefit managers, and payers, spanning 2017–2022, from Managed Markets Insight & Technology (MMIT).
Gaining Access
Do I Qualify?
DISSC advertises newly released publicly available data and maintains a curated listing of licensed datasets acquired for the Yale community. Access to some datasets may be restricted based on discipline or intended use, with restrictions varying across datasets.
Typical Timeline
Licenses have already been established for qualifying researchers. Review and approval processes for gaining access may vary depending on the researcher’s intended use and current discipline.
Step-by-Step Guide
Steps to gaining access vary across DISSC-curated datasets and depend on the researcher’s intended use and discipline. Please review the documentation on DISSC’s data resources site or contact DISSC directly for further guidance.
Relevant Links
Data-Intensive Social Science Center (DISSC): Find information about DISSC’s resources and services.
Discover and Access High-Value Social Science Data: Find DISSC’s curated datasets acquired for social science research 2.
Publications
This section presents a selection of PubMed articles that utilize one of the curated datasets and are authored by individuals affiliated with the Yale University. These articles are provided to inspire researchers and students to use the data in their own work.
-
E-cigarette Marketing Expenditures in the United States From 2016 to 2021: Targeted Media Outlets Geared Toward People Who Are at Increased Risk for Tobacco Use.
Jenny E Ozga, Andrea M Stroup, Melissa H Abadi, Marshall K Cheney, Anuja Majmundar, Kathleen A Garrison, Julia Chen-Sankey, Steve Shamblen, Christopher Dunlap, Cassandra A Stanton
Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco doi: 10.1093/ntr/ntac209
PMID: 36070398